Similarity-Based Alignment and Generalization
نویسندگان
چکیده
We present a novel approach to learning predictive sequential models, called similarity-based alignment and generalization, which incorporates in the induction process a specific form of domain knowledge derived from a similarity metric of the points in the input space. When applied to Hidden Markov Models, our framework yields a new class of learning algorithms called SimAlignGen. We discuss the application of our approach to the problem of programming by demonstration–the problem of learning a procedural model of a user’s behavior by observing the interaction an application GUI. We describe in detail the SimIOHMM, a specific instance of SimAlignGen that extends the known Input-Output Hidden Markov Model (IOHMM). We use the SimIOHMM in empirical evaluations that demonstrates the dependence of the prediction accuracy on the introduced similarity bias, as well as the computational gains over the IOHMM.
منابع مشابه
A generalization of Profile Hidden Markov Model (PHMM) using one-by-one dependency between sequences
The Profile Hidden Markov Model (PHMM) can be poor at capturing dependency between observations because of the statistical assumptions it makes. To overcome this limitation, the dependency between residues in a multiple sequence alignment (MSA) which is the representative of a PHMM can be combined with the PHMM. Based on the fact that sequences appearing in the final MSA are written based on th...
متن کاملSimilarity-Based Alignment and Generalization: A New Paradigm for Programming by Demonstration
We present an approach to learning procedural knowledge by demonstration called similarity-based alignment and generalization. Key to our approach is the ability to induce complex procedure structure (loops and conditional branches) by aligning multiple unannotated demonstrations of a procedure. We present an implemented instance of a similarity-based alignment and generalization algorithm that...
متن کاملHybrid DNA Sequence Similarity Scheme for Training Support Vector Machines
Similarity between two DNA sequences is based on alignment. There are different approaches of alignments; each has its own specialty of bearing different information on DNA sequence. This paper presents a study on similarity kernels based on different similarity schemes and proposes a hybrid one. Similarity Kernel is required in order to represent the distance or similarity between two DNA sequ...
متن کاملIdentification of BKCa channel openers by molecular field alignment and patent data-driven analysis
In this work, we present the first comprehensive molecular field analysis of patent structures on how the chemical structure of drugs impacts the biological binding. This task was formulated as searching for drug structures to reveal shared effects of substitutions across a common scaffold and the chemical features that may be responsible. We used the SureChEMBL patent database, which prov...
متن کاملBayesian Alignment of Similarity Shapes.
We develop a Bayesian model for the alignment of two point configurations under the full similarity transformations of rotation, translation and scaling. Other work in this area has concentrated on rigid body transformations, where scale information is preserved, motivated by problems involving molecular data; this is known as form analysis. We concentrate on a Bayesian formulation for statisti...
متن کامل